CALCULATING DETERMINANTS OF BLOCK MATRICES 



PHILIP D. POWELL" 1 " 

Abstract. This paper presents a method for expressing the determinant of an N X N complex 
block matrix in terms of its constituent blocks. The result allows one to reduce the determinant of a 
matrix with N 2 blocks to the product of the determinants of N distinct combinations of single blocks. 
This procedure proves useful in the analytic description of physical systems with multiple discrete 
variables, as it provides a systematic method for evaluating determinants which might otherwise be 
^—■^ ■ analytically intractable. 
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\Q ■ 1. Introduction. Block matrices are ubiquitous in physics and applied mathe- 

matics, appearing naturally in the description of systems with multiple discrete vari- 
ables (e.g., quantum spin, quark color and flavor) p][2j[3]. ^ n addition, block matrices 
are exploited in many computational methods familiar to researchers of fluid dynam- 
ics [H [S]. The need to calculate determinants of these matrices is almost equally 
widespread, for both analytical and numerical applications [6l [?]■ For example, a 
■ model of high density quark matter must include color (3), flavor (2-6), and Dirac (4) 

indices, giving rise to a matrix between size 24 x 24 and 72 x 72, with any additional 
properties enlarging the matrix further [1 , 8 . 

The problem of calculating the determinant of a 2 x 2 block matrix has been long 
studied, and is a most important case, since it can be extended to any larger matrix 
in the same way that the determinant of an arbitrary square matrix can be expressed 
in terms of the determinants of 2 x 2 matrices, via minor expansion [9]. The solution 
| to this problem can be obtained by considering the equation 
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where A, B, C, and D are k x k, k x (N - k), (N - k) x k, and (N - k) X (N - k) 
matrices respectively, I and are the identity and zero matrix respectively (taken to 
be of the appropriate dimension), and we have assumed that D is invertible. If D" 1 
does not exist, Eq. (|l.lj) can be rewritten in terms of A -1 , with the right side lower- 
triangular [10]. If neither inverse exists, the notion of generalized inverses must be 
employed [TTJ [T21 H3] ■ We do not consider these complications in the present analysis, 
but rather simply assume the existence of all inverses that arise in the following 
calculations. In practice, this assumption is of nearly no consequence, since the blocks 
will generally be functions of at least one continuous variable (e.g., momentum), and 
any singularities will be isolated points of measure zero. 
Taking the determinant of Eq. (jl.ip yields the result [5] 

d<3t (c d) = det ( A - BErlc ) det ( D )> (1-2) 

where we have exploited the fact that the determinant of a block tringular matrix is 
the product of the determinants of its diagonal blocks. The matrices appearing on 
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the right side of Eq. (|1.2[) are the (N — k) x (N — k) matrix D, and the k x k Schiir 
complement with respect to D [T?J [T5] . 

While Eq. (|1.2p has proven immensely useful, there remain many applications 
in which this result is not ideally suited. For example, there are situations in which 
physics dictates that we partition a matrix in a form different than in Eq. 
The description of intermediate-energy quark matter in quantum chromodynamics, 
for instance, involves propagators which possess four Dirac indices, three color indices 
and three flavor indices. While one could, in principle, partition the resulting 36 x 36 
matrix as shown in Eq. (|1.2[) , such a partitioning would treat the Dirac indices 
asymmetrically, eliminating two of the indices while retaining two others. While such 
a division is of no consequence in computational applications, it is quite undesirable 
in analytical descriptions. Thus, we are led to desire a generalization of Eq. (|1.2I) 
which holds for a block matrix of more general partitioning. 

The remainder of this paper is organized as follows. In Sec. [21 a direct, "brute 
force" calculation of the determinant of an N x N complex block matrix is presented, 
which manifests clearly the emergence of a Schiir complement structure. In Sec. |3l an 
alternate proof of our result is given, which is somewhat more elegant, but perhaps 
less intuitive, by means of induction on N. Finally, in Sec. IU some applications of 
our result are discussed, including the cases N = 2 and N = 3, as well as a 48 x 48 
matrix with N — 6, which arises in the study of quark matter. 

2. Direct Calculation . 

Theorem 2.1. Let S be an (nN) x (yiN) complex matrix, which is partitioned 
into N 2 blocks, each of size n x n: 
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S12 
S22 



_SjVl SjV2 

Then the determinant of S is given by 
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Proof. In order to facilitate computing the determinant of S, we define a lower 
triangular auxiliary matrix 



U = 



I 

U 2 i I 
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Forming the product SU gives 

Sll + Si 2 U 2 i + • • • SiatUati S12 + S13U32 + • • • SiArUAT 2 • • • Sijv 
S2I + S22U21 + • • ■ S27vUtvi S22 + S23U32 + • • • S2AtUtv2 1 ' ' &2N 

Sjvi + Sat2U 2 i + • ■ • SnnXJni Sn2 + Sat 3 U 3 2 + • • • SnnUn2 ■ ■ ■ Sjvjv 

(2.6) 

Now, let us assume that the blocks of U can be chosen such that the product SU is 
upper triangular. In this case, we obtain (N 2 — N)/2 constraint equations (the lower 
triangular blocks of SU = 0) with (N 2 ~ N)/2 unknowns (the blocks of U). The 
N — k equations arising from the A;th column of SU are 

Sfc+l,fc + Sk+l,k+l^Jk+l,k + • • • + Sfc+^jvUjvfc = 0, 

Sfc+2,fc + Sfe +2 ,fe+iUfc + i : / £ + • • • + Sk+2, N^Nk = 0, (2.7) 



&N,k + SjV,fc+lUfc+l,fc + ' ' ' + SatjvUjv/s — 0. 

In order to simplify our notation, we now define the block vectors Sy (Eq. 12. 40 ) and 



Ui,- = I Ua U 



U 



(2.8) 



which represent the portions of the A;th columns of S and U, respectively, which lie 
below, and inclusive of, the block with indices We also let S& represent the 

k x k block matrix formed from the lower- right corner of S, 



&N-k+2,N-k+l 



SjV-fc+l,JV— fc+2 
SjV-fc+2,jV— &+2 



Sjv— fc+i,jv 

&N-k+2,N 



&N,N-k+l &N,N-k+2 ■ ■ ■ &N,N 

With these definitions, we can rewrite Eqs. (|2.7p in the matrix form: 

SjV-/5 U fe+l,fc = — s fc+i,fc- 

Solving for the auxiliary block vector yields 

Ufc+i,fc = —S N _ k Sk+i,k- 



(2.9) 



(2.10) 
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We now define cr^ as the block row vector of S lying to the right, and inclusive of, 
the position (Eq. (|2.4p ). Inspecting Eq. (|2.6[) . we can express the kth diagonal 
element of SU in the form 



(SU)fcfe — Sfefe + <r^ fe+1 Ufc + i : fc, 

= S fcfc - cr^ i . +1 S JV :1 _ fc S/s_|_i ) i : , 
= Sjv_A,+i/Sfcfc, 



(2.12) 



where Sjv-fc+i/Sfefc denotes the Schiir complement of Sjv-fc+i with respect to 



(k) 

Next, defining the matrices a- as shown in Eq 



we write (SU)fcfc = a 
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Since SU is upper triangular by design, the determinant is simply the product of the 
determinants of its diagonal blocks. This, together with the fact that det(U) = 1 



gives 



N 



det(S) = Y[ det(a 



(N-k) 
kk 



(2.13) 



k=l 



In order to express det(S) in terms of the blocks of S, we must now evaluate S fc , 
which appears in Eq. (|2.3p . However, rather than approaching this problem directly, 

(k) 

we will focus on the matrices a\j , and seek to find a recursive relationship between 
matrices of consecutive values of k. Thus, we begin by writing 
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where we have partitioned the block vectors and matrix into sections of block length 1 
and N. Next, we evaluate the inverse matrix above by making use of the Banachiewic 
identity QH [T7] 



[A Bp 1 _ 



(A-BD^C)- 1 
-D _1 C(A-BD _1 C)" 
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Identifying this expression with the partitioned form of S k in Eq. (|2.14l) . the Schiir 
complement with respect to S^r-fe N-k becomes 



A — BD *C = SjV-fc,iV-fc — fW-fe.JV-fc+l^fe ls N-k+l,N-k 
~ "jV-fc.AT-fc- 

Thus, evaluating the inverse matrix in Eq. (|2.14|) . we have 
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(fc) • 

We can therefore express the ol\- in the recursive form: 



a (0) -S- 



,(k+l) _ (k) _ (k) ( (fc) \-l a W 
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Given this recursive relationship, the matrix a^p can be read off directly from S, and 

all higher order a$ can be calculated iteratively. Finally, the determinant may be 
computed via Eq. |2"TT5]) . □ 



3. Proof by Induction . Rather than calculating the determinant of S directly, 
as in the prior section, here we begin by establishing a recursive relationship between 
a sort of generalized Schiir complement of S. To this end, we first prove the following 
lemma. 

Lemma 3.1. Let S be a complex block matrix of the form 



Sn Sl2 

S21 S22 



Sni S 
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(3.1) 



and let us define the set of block matrices {a.(°\ a^ 1 ', ■ ■ ■ , a^ N where aW is an 
(N — k) x [N — fc) block matrix with blocks 



a (0) -S" 
n (k+l) (fc) _(fc) 
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Then the determinants of consecutive a^ k ' are related via 



det(a<*>) = det(a( fe+1 )) det(a^ kN _ k ). 
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and let us denote the upper-left block of oS k ^ by cS k \ The Schiir complement of 
with respect to is then 
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Applying the 2x2 block identity (Eq. fll.2[) ). we can therefore write the determinant 
of or*) as 



det(a^) =det(a^ +1 0det(a 
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Theorem 3.2. Given a complex block matrix of the form S2.1\) . and the matrices 
a£j defined in Eq. \3.S\) . the determinant of S is given by 



det(S) = Y[ det(a 



(JV-fc)^ 

fcfc ; 
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Proof. Recall from the definition in Eq. (13.21) that a' ' = S. Starting with det(S) 
and applying Eq. (|3.6p repeatedly yields 

det(S) =det(a^), 

= det(aW)det(a$ v ), 



det(a^) det(a^_ 1 N _J det(a^), 



.,(!) 



(0) 



; (3.8) 

= det(a (JV - 1} ) det(a^" 2) ) • • • det(a^), (3.9) 

where the sequence of equalities terminates because a^ -1 ) is a 1 x 1 block matrix, 
which cannot be partitioned further. Thus, writing a^^ 1 ^ = ^ we obtain the 
result 
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4. Applications . Having derived an expression for the determinant of an N x N 
complex block matrix, it will be useful to examine the result for a few specific values 
of N. We choose to consider N = 2 and N — 3, as the first is the well-known result 
of Eq. (|1.2p and the second gives a clear picture of what sort of objects our result 
actually involves. Larger values of N quickly become cumbersome to write down, 
but the procedure for their calculation will be made clear. Lastly, we present a "real 
world" application of our result by calculating the determinant of a 48 x 48 matrix, 
which arises in the study of quark matter. 

4.1. 2x2 Block Matrices. In the case N = 2, Eq. (j2~T3"f reduces to 

dct(S) = det(a^) det(a 2 ° 2 ) ), (4.1) 

while Eqs. (I2.18|) reduce to 

ag^Sy ag } = S 4 ,-S t2 S 22 1 S 21 . (4.2) 
Thus, we obtain the result 

det(S) = det(S u - SiaS^Sai) dct(S 22 ). (4.3) 

Note that this expression is identical to Eq. (jl.lj) . It is instructive to combine the 
two determinants in this expression and rewrite it in the alternate forms 

dct(S) = dct(SnS 22 - Si 2 S 22 1 S 2 iS 22 ), 

= dct(S 22 S n - S 2 2Si2S2 2 1 S 2 i). (4.4) 

In these forms, it is clear that when either Si 2 or S 2 i commute with S 22 our expression 
reduces to: 

Si 2 S 22 = S 22 Si 2 : det(S) = det(S 22 Sn — Si 2 S 2 i), 

S 2 iS 22 = S 2 iS 2 i : det(S) = det(SnS 2 2 — Si2S 2 i). (4.5) 

Alternatively, for anti-commuting matrices, which often arise in the study of fermionic 
systems, the sign in (|4.5j) becomes positive. 

4.2. 3x3 Block Matrices. In the case N = 3, Eq. (|2~T3l reduces to 

det(S) = det(af 1 ) ) det(a 2 1 2 ) ) det(ag ) ), (4.6) 
while Eqs. (|2.18|) reduce to 

a< if = ajj = Sij - Si3S 3 3 1 S3i, 

a ij = ~ SisS^Ssj] 

— [S,2 — S i3 S 3 3 1 S3 2 ] [S 22 — S 2 3S 33 1 S3 2 ] [S 2 j — S 2 3S 33 1 S3 : ,] . 



Thus, we obtain the result 



(4.7) 



det(S) = det [S n - S^S^SsiJ 

— [S12 — Si3S 33 1 S3 2 ] [S 22 — S 2 3S 33 S 32 ] [S 2 i — S 23 S 33 S3i] 

x det (S 22 - S 23 Si 3 1 S32) det(S 33 ). (4.8) 
Analogously to the N = 2 case, the commutation of certain blocks (e.g., S33 and S3.,, 
with 0^2) allows this expression to be simplified. 
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4.3. Eigenenergies of high-density quark matter. Having considered the 
general form of the determinant of 2 x 2 and 3x3 block matrices, we now consider a 
true application of our result by calculating the eigenenergies of quark matter in the 
two flavor Nambu-Jona-Lasinio model [3J [THJ HH] • In this model, the energies are the 
roots of the equation det S = 0, where 



t + M7° - M 
-A*7 5 t 2 A 2 



A75T2A2 
- ^7° - M 



(4.9) 



and where M is the effective quark mass, /i is the chemical potential, A is the pairing 
gap, $ = Ej° — 7 • k, where k = (k x ,k y ,k z ) T is the quark momentum and the 7" 
(v = 0...3) are the 4x4 Dirac matrices: 



7° = 



7 = 





-CTi 







with er = (a x , a y , a z ) T representing the vector of Pauli matrices: 



1 

1 







1 

-1 



(4.10) 



(4.11) 



In addition, 75 = i7°7 1 7 2 7 3 , t 2 is the second Pauli matrix in flavor space, and A 2 is 
the second Gell-Mann matrix in color space: 



-i 

1 




(4.12) 



Thus, while we have written Eq. (|4.9|) in 2 x 2 block form, each block is itself a 24 x 24 
matrix (2 (flavor) x 3 (color) x 4 (Dirac)). 

Before constructing the determinant from Eq. (|2. 13|) . we must choose how we 
wish to partition S. We could, for instance, choose to begin with the 2x2 block form 
shown in Eq. (|4.9[) , in which case the would be 24 x 24 matrices, which we would 
partition further, repeating the process until we have eliminated all indices. While 
this choice has the advantage of requiring the construction of only a single (a^) 
for the first step (Eq. (|4.1|) ). the price is that correspondingly more steps are required 
to finally obtain det S. 

As a middle ground, balancing the number of a'^'s which must be constructed 
in each step with the number of steps, we will partition S into a 6 x 6 block matrix, 
with each block of size 8x8. We achieve this partitioning by writing out the color 
indices explicitly, while leaving the Dirac and flavor indices intact. Thus, the non-zero 
blocks of S become 



Su = S 22 = S33 = f + M7° - M, 
S 44 = S 55 = S 66 = f - M7 ° - M, 
S 24 = -S15 = «A7 5 t 2 , 
S42 = -S51 = iA*7 5 T 2 . 



(4.13) 



We now must construct ■ ■ ■ a^ 5 \ From Eq. (|2.18[) . we see that will 

be equal to af^ unless both oq^_ k and ct^_ k ^ are non-zero. As a result, since 
Si6 = Sej = for i.j ^ 6, we find 



a« = S 



101 



1 < *,i < 5. 



(4.14) 
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Next, we note that S15 and S51 are the only non-zero blocks with a row or column 
index of 5. Thus, the only a^' which differs from oq^ is 



a n — S11 — Si 5 S 55 1 S5i. 



(4.15) 



A straightforward application of the Banachiewicz identity (Eq. 15|) ) yields S 55 

_i f-rf + M 



1 55 



(4.16) 



where E k = Vk 2 + M 2 . Substituting this expression into Eq. f)4. 15[> yields 

% - ^7° + M 



(2) 

Hi 



= $ + M7° - M - (-iA 7 5T 2 ) 
= ^ + /i 7 °-M-|A| 2 



- (try - M 



(-«A7 5 r 2 ), 



{E-rf-El 



(4.17) 



where we have used the fact that r| = I and 757^ = —7^75. Meanwhile, the rest of 



the 01^ = = Sij, so that 



a 



(2) _ 



ft + fjr/° — M — \A 



2 fi-^-M 



if i,j = 1) 

else (1 < i, j < 4). 



(4.18) 



Constructing a^ 3 \ we note that a.^} and S42 are the only non-zero blocks in the 
fourth row or column of a^ 2 K As a result, we find 



(3) _ (2) _ (2) 
*22 — a 22 a 2A 



(ai 2 4 } ) 1 



(2) 
*42 ' 



Sll — (— Sl5)(S55) 1 ( — S51), 
Sll — Si 5 S 55 1 S5i, 



J 2 ) 



The rest of the a|? are equal to a)? so we find 



(2) 

1/ v ' 



r (3) = f * + M7° - M - |A| 2 gfe$E^ if U = l or i, j = 2, 
[Sy else (1 < i, j < 3). 



(4.19) 



(4.20) 



(3") 

Since CK33 = S33 is the only non-zero block in the third row/column, we have 

c4 4) =«i 3) > l<i,J <2. (4.21) 
Finally, since — a< 2i = S12 = S21 = 0, we find that the sole block of o^ 5 - 1 is 



a 



(5) 



(4) - off 



(4.22) 
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Having constructed the necessary determinants, Eq. (|2.13[) gives 



det(S) = det(af 1 ) ) det(a^) det(af 3 ) ) det(af 4 ) ) det(a^ 1 5 ) ) det(af 6 } 
= dct ( jg + ^7° - M - |A |2 



f,D 



$ - /i 7 ° - M 



2 -El 



det(^ + ^7° - M) 



x det(# - /i7° - Mf, 



(4.23) 



where the subscript of /, D on the determinant indicates that the remaining determi- 
nant is to be taken over flavor and Dirac indices. 

Thus, we have eliminated 6 of the original 48 indices, and have only the Dirac and 
flavor indices remaining. In fact, the flavor indices are now trivial, having vanished 
upon squaring the matrix ti (see Eq. (|4.17[) ). so that 



d<-( ( S i = det ( t + /i 7 ° - M - | A' 2 * M7 ° M X 



x det(# - /x 7 ° - Mf. 



det(# + M7° - Mf 



D 



(4.24) 



Computing det(# ± — M) yields 
det(# ± M7° - M) = dct 



(E ± fx) - M <r • k 

erk -(E±fi)-M 



= [-(E±nf + M 2 + (*-kfY, 

= [(E±tf-El]\ 

= {E+{E k ±v)] 2 [E~{E k Tn)]\ 

where we have used the fact that (<r • k) 2 = k 2 . 

Finally, computing the remaining determinant from Eq. (|4.24|) . we find 



(4.25) 



det ( # + M7° - M - |A| 



$ — ^7° — M 
(E-^f-El 



(E + ^)-\Af 



E-fi 



El 



IAI 



{E-nf-El 

A 1212 



El 



_ [E 2 - (E k + /i) 2 - |A| 2 ] 2 [ff 2 - (E k - /x) - 
{E-E k -nf{E + E k -nf 

Inserting Eq. (|4.25l) and (I4.26j) into Eq. (|4.24|) , we obtain the result 



(4.26) 



det(S)= E+^(E k+P Lf + \A\2 E + y/(E k -n)* + \A\* 



8 r 



E-y/(E k +ti)* + \A\ 2 \ [E- ^(E k -»f + \A\ 
x(E + E k + [if(E - E k - nf{E + E k - fif(E - E k + fif 



(4.27) 



Finally, then, we can read off the eigenenergies, which are the absolute values of the 
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roots of det(S) = 0: 



Ei — \E k + n\ (multiplicity 8), 

E2 — \E k — n\ (multiplicity 8), 

E 3 = V(^fc+M) 2 + |A| 2 (multiplicity 16), (4.28) 



Ei = y/(E k - fi) 2 + |A| 2 (multiplicity 16). 

Indeed, these are the correct eigenenergies, as reported previously by Rossner [2]. 
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